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ABSTRACT 

We use the H I-selected galaxy sample from the Arecibo Dual-Beam Survey (Rosenberg & Schneider 
2000) to determine the shape of the H I mass function of galaxies in the local universe using both 
the step- wise maximum likelihood and the X/Vtot methods. Our survey region spanned all 24 hours 
of right ascension at selected declinations between 8° and 29° covering ^430 deg 2 of sky in the main 
beam. The survey is not as deep as some previous Arecibo surveys, but it has a larger total search 
volume and samples a much larger area of the sky. We conducted extensive tests on all aspects of the 
galaxy detection process, allowing us to empirically correct for our sensitivity limits, unlike the previous 
surveys. The mass function for the entire sample is quite steep, with a power-law slope of a w —1.5. 
We find indications that the slope of the H I mass function is flatter near the Virgo cluster, suggesting 
that evolutionary effects in high density environments may alter the shape of the H I mass function. 
These evolutionary effects may help to explain differences in the H I mass function derived by different 
groups. We are sensitive to the most massive sources (log M > 5 x 10 10 M Q ) over most of the declination 
range, ~ 1 sr, and do not detect any massive low surface brightness galaxies. These statistics restrict 
the population of Malin 1-like galaxies to < 5.5 x 10 -6 Mpc~ 3 . 

Subject headings: galaxies: mass function — radio lines: galaxies 



1. INTRODUCTION 

One of the main motivations for the Arecibo Dual-Beam 
Survey (ADBS, Rosenberg & Schneider 2000, hereafter Pa- 
per 1) was to determine the shape of the H I mass function 
and, in particular, to determine the amount of mass tied 
up in low H I-mass galaxies. Our data indicate that the 
H I mass function is quite steep down to our effective sen- 
sitivity limit of about 3 x 10 7 A/ Q . As parameterized by a 
Schcchter function, we find a power law slope of a = —1.5. 

The faint end slope of the H I mass function has been 
the focus of considerable controversy, resulting in uncer- 
tainty about the fraction of the overall hydrogen bud- 
get contributed by low-mass galaxies. We suggest that 
the differences found between different groups are at least 
partially caused by environmental influences, with ram- 
pressure stripping and merging in high-density environ- 
ments resulting in the depletion of low mass H I sources. 

Most of the shallower slopes (a w —1.2) for the H I mass 
function have been derived from optically-selected samples 
in the field (Huchtmcicr 2000, Briggs & Rao 1993), or in 
clusters (Briggs & Rao 1993) or H I-selected samples in 
high density regions like the Canes Venatici group (Kraan- 
Korteweg et al. 1999), Centaurus A (Banks et al. 1999), or 
Ursa Major (Verheijen 2000). The results for the faint end 
slope in the field have been more varied. Some H I-selected 
samples (Zwaan et al. 1997) have also suggested a slope of 
a w —1.2, while other studies indicate that the slope might 
be steeper. Early Parkes survey results suggest a slope of 
a « -1.5 (Henning et al. 2000; Kilborn et al. 2001), 
and our analysis of two earlier Arecibo surveys (Schnei- 
der, Spitzak, & Rosenberg 1998, hereafter SSR) suggested 



a steep faint-end rise similar to that of some optical field 
galaxy samples (Loveday 1997; Driver & Phillipps 1996). 

In addition to the effects of environment, we believe a 
significant source of discrepancies between surveys arise 
from differences in the analysis methodology and the de- 
termination of sensitivity limits. One of the most impor- 
tant innovations of the ADBS is the introduction of "syn- 
thetic" sources that were carried through the entire data 
reduction stream in order to accurately characterize our 
recovery rate (Paper 1). Most previous surveys have relied 
on blanket claims of N-a sensitivity without demonstrat- 
ing their completeness at the quoted level. In fact, most 
samples we have examined fail V/V ma x completeness tests 
(SSR; Schneider & Schombert 2000). 

The data for this analysis are derived from the Arecibo 
Dual-Beam Survey (see Paper 1 for survey details). This 
is a "blind" H I driftscan survey that covers ~ 430 deg 2 in 
the main beam. The final galaxy tally is 265 sources in the 
velocity range -654 to 7977 kms -1 . These 265 galaxies 
represent all of the sources detected in by-eye examina- 
tions of the Arecibo data that were reconfirmed at Arecibo 
and/or the VLA. Seven of the galaxies have Mhi < 10 8 
Mq , almost as many low mass sources as found in all ear- 
lier blind surveys combined (we use Ho = 75 kms -1 Mpc -3 
throughout). The average rms noise of the survey spectra 
is 3.5 mJy so a 10 8 M source with a 50 kms -1 veloc- 
ity width would be a 5-ct detection at 22 Mpc (1650 
kms -1 ). 

In §2 we describe the survey sensitivity and the relation- 
ship between a galaxy's HI mass and its observed flux as 
a function of position, frequency, and distance. In §3 we 
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describe the probability of detecting a galaxy as a func- 
tion of the profile linewidth and the observed flux. This 
section makes use of the "synthetic" sources inserted in 
the data to determine these probabilities. In §4 we derive 
the field mass function for the ADBS using two standard 
techniques: the stepwise maximum likelihood (SWML) 
and the l/Vtot methods. We also examine whether the 
mass function might be significantly altered by different 
assumptions like varying our minimum velocity cutoff, our 
distance determination method, or the shape of the com- 
pleteness function. In §5 we discuss the influence of galaxy 
density on the shape of the H I mass function, and show 
how the mass function might be affected by making cor- 
rections for large scale structure (in the l/Vtot method). 
In §6 we examine the limits that the survey places on the 
population of galaxies with extremely high H I masses, and 
in §7 we summarize our results. 

2. SURVEY SENSITIVITY 

The interpretation of H I surveys requires accounting 
for variations in sensitivity as a function of source position 
and redshift. In addition, all current blind H I surveys are 
only sensitive to low mass sources at low redshifts, which 
can potentially introduce large errors in nearby source dis- 
tances. Optical surveys have similar issues — caused by vi- 
gnetting, K-corrections, and magnitude limits — but these 
are generally much less important. 

Whenever possible, we have established empirical rela- 
tionships to describe our survey's sensitivity and/or ap- 
plied alternate methods to test the robustness of our re- 
sults. In the following sections we describe the noise levels 
for our spectra throughout the survey (§2.1), and then 
study the relationship between a galaxy's HI mass and its 
observed flux as a function of position (§2.2), frequency 
(§2.3), and distance (§2.4). 

2.1. Baseline Noise and Coverage 

The average rms sensitivity for individual spectra, af- 
ter Hanning smoothing to a resolution of 32 kms -1 , was 
a i = 3.5 mjy. The variation around this value was small 
except for occasional episodes of heavy broadband inter- 
ference which occurred in about 2% of our observations. 
Since none of our 265 sources were detected during these 
high-noise episodes, we have eliminated them from further 
consideration. 

Over 31% of the area observed, the ADBS scanned the 
position only once, the remainder was covered at least 
twice. We did not coadd spectra where they overlapped 
because there were occasionally slight differences in the 
data-taking rate, brief gaps associated with data dumps, 
and episodes of broadband interference. All of these small 
differences would have left us with a data cube with very 
complex noise variations if wc had coadded the data. 
Rather, we examined doubly-covered regions in parallel, 
using the duplication of faint sources to help us more reli- 
ably identify sources. 

Even though the spectra in double-covered regions were 
not coadded, examining the data in parallel allowed us to 
detect fainter sources. This sensitivity improvement is re- 
flected by our detecting 77% of our sources in the 69% 
of the survey area that was double-covered. We estimated 
the improvement in sensitivity by comparing V/V max com- 



pleteness tests (see SSR) for single- and double-covered re- 
gions. We find that double coverage was equivalent to a 
noise reduction by a factor of 1.2, giving an effective rms 
noise for double-covered regions of cr 2 = 2.9 mjy. 

2.2. Position Dependence 

Because the ADBS was a driftscan survey, the sensitiv- 
ity to sources decreased with increasing declination-offset, 
AS, from each feed's fixed declination. The theoretical "in- 
tegrated" beam of the survey, which describes the sensi- 
tivity decrease, is shown in Paper 1. The actual sensitivity 
is more complicated since it depends on the convolution 
of each galaxy's H I distribution with a beam that is not 
precisely azimuthally symmetric and which is sampled at 
intervals that do not integrate each galaxy's entire flux, 
J S dv, as it passes through the beam. We determined the 
galaxies' fluxes (/ S dv) with follow-up, centered observa- 
tions at Arecibo and the VLA, and compared these with 
their originally observed fluxes (J S b s dv) as a function of 
their declination offset. To simplify the notation, we will 
call these integrated fluxes or "signals" S and S b s respec- 
tively. We found that the empirical relationship between 
the two was well fit by the function: 

f (AS) = 1 + 0.28 -abs(AS) l s (1) 

Sobs 

for AS in arcmin. The first sidelobe is at A<5 ~ 4.4' (see 
Paper 1) where / drops to about 0.2. Beyond AS = 4.4' 
the correction factor is more uncertain, but we detected 

20 sources with offsets up to A<5 = 12'. The galaxies with 
large offsets were all high mass galaxies with extended H I 
distributions and had mean detection fluxes ~10% of their 
remeasured values. These 20 sources are not included in 
the H I mass function calculations. 

2.3. Frequency Dependence 

The two line-feed receiver systems used in the ADBS 
each had variable gain depending on the redshifted fre- 
quency of the H I signal. We determined the gain, g(v), 
across the bandpass by examining continuum sources that 
fell within the survey region, and (following standard prac- 
tice at the observatory) fitted the gain variations with a 
Gaussian function of frequency: 

g(v) = 2-[2^-^-)/ 52 ] 2 (2) 

where v cen is the center frequency for the feed. For the 

21 cm feed we determined this to be 1408.5 MHz; for the 

22 cm feed, 1398.5 MHz. Both feeds had a half-power 
frequency response width of 52 MHz. 

We determined the statistics for single and double sky 
coverage by the 21 and 22 cm feeds separately because of 
the differences in the feed responses. The 22 cm feed was 
more often affected by the occasional broadband interfer- 
ence and had a lower gain at the small redshifts so it was 
less sensitive to the lowest-mass sources, which can only 
be detected nearby. 

2.4. Distance Estimates 

The ADBS search volume includes the Virgo Cluster. 
This not only introduces possible problems with respect 
to variations in the shape of the mass function because 
of the environment, but also causes large uncertainties in 
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the redshift/distance relationship. We adopt the Tonry et 
al. (2000) flow model to derive distances from our mea- 
sured redshifts, and assumed a Hubble constant of H = 75 
kms -1 Mpc" 1 . We tested the sensitivity of our results to 
the choice of flow model by also carrying out the calcu- 
lations using a simple distance estimate based on Vq, the 
heliocentric velocity corrected by 300 cos(6) sin(Z) for Local 
Group motion (de Vaucouleurs et al. 1977; also see §4.3). 

The flow correction model reduces distance uncertain- 
ties, but within 6° of the center of the cluster large un- 
certainties are impossible to avoid so we handle this re- 
gion separately in our analysis (§5.2). We believe that 
the Tonry model provides as effective a distance correc- 
tion as is available, but it is important to note that all 
current blind H I surveys suffer from detecting low mass 
sources at small redshifts where the distance uncertainties 
are large. It will take a much deeper survey covering a 
large volume to improve this situation. 

Given the distance of each source we can compute its 
mass in solar masses from: 

M H i = 2.36 x 10 5 D 2 S, (3) 

(Roberts 1975) where D is the distance in Mpc. Equiv- 
alently, we can predict the expected noise-free detection 
signal strength from a source located at any position in 
our survey volume as: 

_ 6 g(v)M HI 



S det = 4.24 x 10" 



(4) 



Note that since we have an implicit relationship between 
D and v, the detection signal strength can be predicted 
from the source mass, distance, and declination offset. 

The distinction we make between S a bs and Sdet is that 
the former includes noise and may suffer from peculiarities 
due to the specific observational conditions, like asymme- 
tries in the beam shape. There are several other small 
effects that we have ignored: 

(1) Variations with declination: the sky moves through 
the telescope beam by up to 5% faster or slower at the 
southernmost and northernmost of our declination strips. 
This actually has an even smaller percentage effect on the 
maximum observed signal strength because the galaxies 
dwell within the beam for several sampling intervals. 

(2) Variations with zenith angle: Because of the Arecibo 
telescope's design, there are variations with angle from the 
zenith. Since all of our observations were made within 11° 
of the zenith where there was very little "spillover," these 
variations are less than a few percent. 

(3) Variations with source size: If a source is larger than 
the beam size, the measured signal will be smaller than we 
predict. This proves to be unimportant for our results, be- 
cause all sources that "fill the beam" have strong signals. 
The few sources for which there is a more accurate flux in 
the literature have had those values substituted (see Pa- 
per 1). This is effectively a statement that there are no 
galaxies with such low H I surface densities that long in- 
tegrations would be required to detect them, which was 
shown by the deeper survey of Zwaan et al. (1997). Note 
that this is not necessarily the case for synthesis observa- 
tions where the beam size is much smaller, and the H I 
surface brightness sensitivity generally poorer. 



The adjustments we have applied are just for the effects 
that result in at least tens of percent change. 

3. SENSITIVITY AND COMPLETENESS 

The preceding section described how the signal strength 
and noise levels vary with location in the survey search vol- 
ume. We turn next to the probability of detecting an H I 
signal of a particular linewidth w and integrated flux Sdet 
in a spectrum with an rms noise a. 

Previous surveys have generally attempted to make 
plausible assumptions about some signal level to which 
they believe they should be complete, but we have found 
(SSR; Schneider & Schombert 1999) that such claims do 
not pass muster with completeness tests. Since all H I 
surveys to date have relied to varying degrees on human- 
eye inspection of data, we believe it is essential to build 
into the detection step a means of assessing the survey 
completeness, C, empirically 

Our approach to determining the ADBS complete- 
ness was to insert a large number of "synthetic" sources 
throughout the survey data (see Paper 1). The synthetic 
profiles were modeled to look very similar to observed H I 
profiles, and were inserted early in the data-processing pro- 
cedure so that they would be treated like real sources, suf- 
fering, for example, the effects of automated baselining 
procedures. The synthetic sources were given randomized 
positions, line widths, and line strengths, and their loca- 
tions within the data stream were unknown to us during 
the detection steps. By determining the rate at which we 
recover synthetic sources of a particular line width and sig- 
nal strength, we can empirically establish the probability 
of detecting sources with different linewidths and signal 
strengths. 

We find that the shape of the completeness function is 
basically the same for different line widths w, up to some 
factor in the signal strength Sdet- We can write: 



C(w,S detl a) = C 



>det 



Afeff(w) 



(5) 



where Af e f f(w) is an effective noise level determined empir- 
ically for different line widths. Note that synthetic sources 
require no corrections for declination offset or gain correc- 
tions so that we know the input, noise- free value of the 
detection flux. In §3.1 we explain the derivation of the 
line width dependence and in §3.2 the variations of com- 
pleteness with signal strength. 

3.1. Line Width Dependence 

H I surveys have sensitivity variations depending on the 
velocity width of the spectral line. The same total signal 
S spread over a larger frequency width, has a lower mean 
flux density and is harder to detect. 2 

The empirical line-width dependence is slightly different 
than would be predicted from basic statistical arguments, 
but these differences do not have a strong effect on the fi- 
nal determination of the mass function as we show in later 
sections. A statistical model of the noise dependence on 
line width J\f(w) is that it grows as the root sum square 
of the uncorrelated noise a in individual channels. In this 

2 Note that there are related problems for optical surveys that behave in the opposite direction — face-on disk galaxies have a lower surface 
brightness and are therefore harder to detect than more edgc-on disks, whereas their H I line width is narrower, so they are easier to detect. 



4 



Rosenberg and Schneider 



theoretical case, the noise should grow as w 5 . However, 
in our analysis of earlier Arecibo surveys (SSR), we found 
that the detectability of wide-line sources showed a more- 
rapid decline with line width than this noise model pre- 
dicts. Such a behavior can be explained as an effect of 
basclining and other data-processing procedures that may 
affect wider line width signals more adversely than narrow 
ones. 

Based on our empirical results for the synthetic sources, 
the signal strength at which the completeness drops to 
50% increases as w - 75 , or equivalently we can describe 
the effective noise Af e ff °c Afw - 25 . We arbitrarily set the 
effective noise value to match the statistical value at a 
lincwidth of 300 kms -1 , yielding: 



eff 



32a{w 2Q /32) 
(300/32) 



0.75 



0.25 



(6) 



where a is the rms noise in Jy, W20 is the line width of the 
detected signal in km s _1 measured at 20% of the peak flux 
density, and 32 refers to our velocity resolution in kms -1 . 
Note that the choice of 300 kms -1 for normalizing Af e ff 
does not in any way affect our results since the determi- 
nation of the completeness C simply uses this to scale the 
fluxes for different linewidths. 

This dependence on line width is in agreement with what 
we found for the surveys of Spitzak & Schneider (1998) 
and Zwaan et al. (1997) based on their V/V ma x statis- 
tics for different line widths (SSR). This suggests that the 
reduced sensitivity to wide-line sources may be a fairly 
generic property of H I surveys. 

The effect of the line width dependence is that sources of 
the same mass are not necessarily detectable to the same 
distances. Therefore, in order to characterize the H I mass 
function, wc need to detect a sufficiently large number of 
sources so that we have a representative sample of the dif- 
ferent characteristics of galaxies in each mass range. 



3.2. The Completeness Function 

We determined the probability of detecting sources — 
the completeness C — by the rate at which we recovered the 
synthetic sources as function of their signal strengths. We 
initially examined different line width ranges separately, 
but the shapes of the curves were similar, so we folded the 
data together, scaled by the effective noise, to empirically 
estimate C(Sd e t/Neff)- Again note that we are studying 
the synthetic sources so we know the input, noise- free value 
of the detection flux. The resulting completeness function 
is shown in Figure 1. 

The figure shows that the edge between detectability 
and non-detectability is not sharp. This is a combination 
of at least two effects: (1) noise added to sources near the 
limit of detectability of the survey may push sources above 
or below the detection threshold; and (2) the interaction 
of a particular line shape with the detection methods, 
whether automated or conducted by-eye, will introduce 
some uncertainty in detections near the nominal "limit." 
The dashed line in the figure is an error function fit to 
the data. An error function is the expected result when 
Gaussian noise falls on top of an underlying signal. 
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Fig. 1. — The relationship between completeness and S^t /Af e f f ■ 
This function was determined using the "synthetic" sources from 
Paper 1. The dashed line is an error function fit to the data. 

We found that the detection fraction never reached 1.0, 
mostly because we would occasionally lose bright sources 
near interference spikes or when the automated baselining 
procedure did not work well (see Paper 1). We found that 
we could not isolate this effect to a few individual frequen- 
cies, so we leave it as an overall correction that will make 
our estimated source densities about 10% higher. 

Because of the characteristics of gaussian noise, any cut- 
off in signal-to-noise inevitably includes some sources be- 
low the cutoff limit and excludes some sources above it. 
One can approximate a step- function sensitivity by detect- 
ing sources to a deep limit and then using high sensitivity 
follow-up observations to eliminate all sources below where 
the roll-off in completeness becomes significant. Unfortu- 
nately, this would necessitate using a very high cutoff level 
(S/Af w 10), and ignoring a large amount of data. 

Another approach might be to choose a lower cutoff, 
like 5- or 7-a, and ignore all sources that prove to be 
weaker than this limit upon follow-up observations. This 
approach will exclude less data than a high-sigma cutoff, 
but it has the disadvantage of uncertain effects due to the 
incompleteness of low- flux sources. 

With knowledge of the completeness function, it is pos- 
sible to retain all detected sources when estimating the 
mass function. We know the fraction of sources detected 
at any signal-to-noise level, so we can correct our esti- 
mates of the volume of space searched. Essentially, where 
a source's signal strength is so weak that our probability of 
detection is half, we have effectively searched half as much 
volume at that distance. 

3.3. Completeness and V/V m ax 

The V/V m ax test (Schmidt 1968) is often used to deter- 
mine whether a survey's sensitivity has been properly de- 
fined. The V/Vmax test assumes a step-function sensitivity 
cutoff and uniformly distributed sources. Given these con- 
ditions, the average of the volumes interior to all sources 
should equal half of the maximum volume within which 
these sources could have been detected: V/V m ax = 0.5. 

If we allow for a roll-off in completeness, some sources 
will be detected that are actually weaker than whatever 
nominal sensitivity limit is chosen, and they will have 
V/Vmax > f • It is no longer clear what value of V/Vmax 
should result from such a sample, or whether the test is 
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applicable. 

Actually, it is straightforward to calculate the expected 
mean value of V /V max given the completeness function. 
We carried out a simple Monte-Carlo calculation for 10,000 
uniformly-distributed sources, randomly selected accord- 
ing to the value of the completeness function for their pre- 
dicted fluxes at their assigned distances. Defining V max in 
terms of the limiting distance for a source with a flux at 
the 50% completeness level, we expect V /Vmax = 0.61. 

The detected ADBS sources have V/V ma x = 0.60 when 
measured in the same way, which is in excellent agree- 
ment with the expected value. This agreement shows that 
the real detected sources behave in much the same way as 
predicted by our completeness function, which was based 
purely on the synthetic sources. 

4. THE H I MASS FUNCTION 

4.1. Two Methods for Determining the Mass Function 

Given our results for the variation of signal strength 
within our search volume, and the probability of detect- 
ing sources of a particular signal strength and linewidth, 
determining the H I mass function is, in principle, a 
straightforward matter. We estimate the mass function 
using two well-known techniques: the "1/Vtot" method 
(see SSR) and the step-wise maximum likelihood (SWML, 
Efstathiou et al. 1988) method. 

These two techniques are complementary in several re- 
spects: 

(1) The SWML method is formulated to be indepen- 
dent of large scale structure effects. However, the SWML 
method assumes that the shape of the mass function is 
the same everywhere, which is particularly questionable 
for H I because of gas-stripping and merging in high den- 
sity regions. The l/Vtot method is simpler in concept and 
makes no prior assumptions about the uniformity of the 
mass-function shape. 

(2) With the 1/Vtot method, the overall normalization 
of the mass function is directly determined. Special tech- 
niques are used to normalize the results from the SWML 
method, which are not easily adapted to the complex po- 
sitional and distance dependencies in an H I survey. Note 
that for neither method does the normalization affect the 
shape. 

(3) The 1/Vtot method requires the detailed knowl- 
edge of survey sensitivity over the entire search volume 
as worked out in §2 and 3 to calculate the total volume 
(Vtot) hi which a source might have been detected. By 
contrast, the SWML method only requires that we find 
the maximum distance at which a source could have been 
detected at its detected position — this is simpler, since we 
can just scale the detected flux for distance and frequency 
dependencies. 

(4) We have further simplified the SWML calculation 
relative to the 1/Vtot method by assuming a sharp cutoff 
in sensitivity at 7 times the effective noise Af e f / predicted 
for a source of that linewidth. In part, this provides a test 
of whether our sensitivity "roll-off" might have an impor- 
tant effect on the shape of the mass function. 

The total volume or 1/Vtot method was originally pro- 
posed by Schmidt (1968). Vtot is the total volume within 
which a source could have been detected. If we take all the 
detected sources within a particular mass range summing 



up ^2 1 /Vtot gives us a direct estimate the number density 
of such sources. 

Because we have determined the detailed shape of the 
completeness function, we are able to more accurately es- 
timate the total effective search volume by weighting each 
position according to the probability of detecting a source 
there, as discussed in §3.2. The detectable volume for a 
galaxy is therefore: 

Vtot = y>(<r)/ / m C(l,S d et,o-)D 2 dDd(A5) (7) 

o=a u o 2 JA6=-4A>JD=v min /H 

where A(a) is the angular extent of the survey with the 
rms level o\ or a 2 , for cither single or double cover- 
age, and C is the completeness function described in §3. 
The distance variable implicitly incorporates the assumed 
rcdshift-distance relationship and the gain dependencies 
on frequency. The integration over distance ranges from a 
minimum velocity v m i n below which confusion with Galac- 
tic H I and high velocity clouds are likely to make source 
detection difficult to the maximum redshift covered by our 
spectra, 7977 km s _1 . Positional and frequency dependen- 
cies are carried implicitly within Sdet which varies with 
distance, declination offset (A<5), and frequency, as dis- 
cussed in §2. In practice, we carry out this integration by 
summing over small intervals and calculating the predicted 
value of Sdet at each position and velocity. 

The SWML method (Efstathiou et al. 1988) was de- 
signed to directly remove the effects of density variations 
caused by large-scale structure. It divides the mass func- 
tion into a series of bins and solves for the most likely set 
of relative weights for the bins. The likelihood function 
for each source is the ratio of the weight of its own mass 
bin to the sum of the weights of all of the mass bins in 
which a source, with the same redshift and limiting flux, 
could have been detected. By taking this ratio of weights, 
density effects are divided out. The total likelihood is the 
product of the likelihoods for all of the sources. A set 
of mass-bin weights that maximizes the total likelihood is 
found iteratively. 

By its design, the SWML method allows each source to 
have a different flux limit. The method is usually used 
in optical surveys where the limiting flux is uniform, but 
there is nothing in the design of the method or most im- 
plementations of it that requires this. We use a version of 
the method from Kochanek et al. (2001). 

The likelihood function in the SWML method assumes 
a sharp cutoff in the sensitivity. It could probably be 
adapted to include the completeness function, but as noted 
in the introduction to this section, we use a fixed "com- 
pleteness limit" for the SWML method rather than try- 
ing to modify the routine. We use only the 210 sources 
brighter than 7 times the effective noise N e ff- Because 
the completeness is not 100% at this limit, we will tend to 
slightly underestimate the number of sources near the flux 
limit. 

Because the H I source sensitivity varies over our search 
volume depending on a variety of parameters (§2), it is 
difficult to normalize the results from the SWML method. 
Instead, we scale the results to match the 1/Vtot results 
for high-mass (> 1O 9 M ) sources. The high mass sources 
were detectable over most of the potential search volume, 
so the uncertainties in their density are small. Note again 
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that this does not affect the calculated shape of the mass 
function derived by the SWML method. 

We show in the remainder of this section that the two 
methods display substantially similar H I mass functions. 
Since the two methods use different approaches, this pro- 
vides some reassurance that there is not a fundamental 
error in one of our approaches. Of course, both methods 
make many of the same assumptions about sensitivity, and 
we attempt to test those assumptions in various ways in 
subsequent sections. 

For the 1 / V to t calculation of the mass function we in- 
clude 233 galaxies, all of the galaxies confirmed in the 
Arecibo and VLA follow-up that were originally detected 
within 4.4' of the center of the main beam and that are 
> 6° from the center of Virgo. We recognize that this is 
a small sample of galaxies compared to optical surveys, 
however, we feel we have the best characterizations of HI 
survey sensitivity made to date, so our conclusions are as 
strong as the data allow. 

4.2. The H I Mass Function 

Figure 2 shows the H I mass function excluding the cen- 
tral 6° of Virgo based on the SWML method (filled circles) 
and 1/Vtot method (open squares). The curves in the fig- 
ure are Schechter (1976) functions: 

$(M) = — -— — — — = $*lnlO{M HI /M*) a+1 e- MH '/ M * 
d log M 

(8) 

where a is the power-law slope of the faint end, M* is the 
characteristic turn-over mass, and ln(10/e)$* is the den- 
sity per mass decade at M*. The minimum x 2 fit to the 
SWML points gives: a = -1.53, log(M*/M ) = 9.88, and 
$o = 0.005 Mpc~ 3 . (As noted earlier, the $o normaliza- 
tion value is ultimately based on scaling the SWML result 
to the 1/Vtot results.) The dashed line shows the mass 
function fit from Zwaan ct al. (1997) from their Arecibo 
survey: a = -1.2, log(M*/M ) = 9.80, and $ = 0.0059 
Mpc~ 3 (adjusted to H = 75 kms" 1 Mpc -1 ). 
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Fig. 2. — The H I mass function outside of the Virgo cluster core. 
The mass function is shown using both the 1/Vtot method (open 
squares) and the step-wise maximum likelihood method (solid cir- 
cles). The solid line is the best-fit Schechter function: a = —1.53, 
log(M*/M©) = 9.88, and <t>, = 0.005 Mpc~ 3 . The dashed line 
shows the Zwaan et al (1997) Schechter model fit to the AHISS 
data. Error bars indicate "1-cr" uncertainties; see text. 

The "l-cr" error bars in Fig. 2 for the 1/Vtot method are 
based on small number statistics (Gehrels 1986). The bins 



were selected by an automatic procedure that attempted to 
keep at least four galaxies in each bin unless the bin widths 
would otherwise become very small or large (Alog(M/M & ) 
> 0.4), except at the bright end where a single galaxy was 
allowed to define a bin. The error bars for the SWML 
method include an internal estimate of the uncertainty in 
the normalization and are highly correlated by the nature 
of the method. 

It is apparent from the figure that the two approaches 
yield very similar-shaped mass functions. It is also evi- 
dent that our mass function is significantly steeper than 
the a = —1.2 power law found by Zwaan et al. (1997). 
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Fig. 3. — The chi-square contours of the Schechter function fit to 
the SWML determination of the H I mass function (shown in Figure 
2). The contours represent 1, 2, and 3 a. 

The shallow a. — —1.2 power law is in good agreement 
with our results at high masses, but it becomes succes- 
sively worse at lower masses. Figure 3 shows the prob- 
able range of Schechter function parameters based on a 
chi-square goodness of fit to the SWML calculated mass 
function. Part of our difference from Zwaan et al. prob- 
ably comes from our better number statistics; they had 
detections of only 66 sources, and only 51 with A<5 < 4.4'. 
Our sample contains 7 sources with H I masses < 10 8 M Q 
as compared with 2 (for H = 75 kms -1 ) in Zwaan et al. 
(1997), 2 in Kilborn ct al. (1999), and 4 in Spitzak & 
Schneider (1998). 

We show in §4.3 that no significant changes arise when 
we re-examine our data set in a variety of ways. Large scale 
structure within our survey region also does not appear to 
bias our results (§4.4). However, there are indications of 
differences in the faint end slope when a survey is isolated 
to a cluster region (§4.5). 

4.3. Effects of Analysis Procedures 

There are many small choices in determining the H I 
mass function that might affect our results. We find, how- 
ever, that our result is robust. 
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Fig. 4. — Effects on the H I mass function of varying survey pa- 
rameters. The circles show the effect of using distances based on 
Vq instead of the flow-corrected model of Tonry ct al. (2000). The 
triangles show the result of narrowing the declination offset to 2'. 
The squares show how the l/Vtot results change if a correction is 
made for the large scale structure determined from optical data in 
the survey region (§5.1). The Schechter function curve shows our 
same fit as in the previous figure to aid comparisons. 

In Fig. 4, we show how the mass function is affected 
when we alter some of our basic assumptions. Circles in 
the figure show that only small changes result if we base 
distances on Vq, which makes only a simple correction for 
Local Group motion (de Vaucouleurs et al. 1977), instead 
of the Tonry et al. (2000) flow model. The use of V ap- 
pears to make the mass function slightly steeper, but not 
significantly so. This suggests that distance uncertainties 
are unlikely to radically affect our results. 

Triangles in the figure show the effect of limiting our 
coverage to galaxies with declination offsets smaller than 
Ad < 2' (triangles in figure). Within these smaller offsets, 
our sensitivity corrections (f(A5); see §2) are less than a 
factor of 2. 

Similarly minor changes to the H I mass function re- 
sulted when: (1) sources were detected in regions of single 
coverage or double coverage; (2) the minimum velocity 
cutoff for our volume calculations was changed from 200 
kms -1 to 100 or 300 kms -1 ; or (3) sources were detected 
with the 21 or 22 cm feed. We note that most of the low 
mass sources were detected with the 21 cm feed which had 
better sensitivity at small redshifts. This is another differ- 
ence from the Zwaan et al. (1997) data, which was based 
mostly on 22-cm-feed data. 

4.4. Simulation Tests and the Eddington Effect 

In order to test our methodology, we conducted detailed 
simulations of our entire procedure. We carried out these 
simulations to: (1) ascertain whether our completeness 
correction procedure had any unexpected effects; (2) ad- 
dress the possibility of biases that might have caused the 
l/Vtot to yield a different mass- function slope from an in- 
put population; and (3) determine the probable ranges for 
$ in our low-mass bins where we detect a small number of 
sources. 

Because H I surveys do not yet probe the extragalactic 
population very deeply, biases may be introduced due to 
small number statistics or from uncertainties in the dis- 
tances. Besides testing our procedures, we wanted to in- 
vestigate the possibility that distance uncertainties might 



bias the low-mass end of our derived function. For ex- 
ample, Schechter (1976) discusses a bias in the shape of 
the luminosity function due to high mass sources at larger 
distances having redshifts that erroneously imply they are 
nearby and low-mass. Alternatively, low-mass sources may 
appear to have higher masses, so the overall effect on the 
mass function is not obvious. 

Schechter showed that the "Eddington correction" for 
using redshift as a distance indicator can be quite large 
for faint objects observed at small redshifts where their 
distances are very uncertain. Even though we apply a 
flow correction model and exclude the core of Virgo from 
our derivation (which Schechter did not do) the residual 
uncertainty in distances will still most heavily influence 
the mass determination for low-mass sources since they 
are predominantly nearby. 

We generated samples of galaxies that obeyed an input 
Schechter function, gave them H I lincwidth properties 
that mimicked our detected galaxies, allowed for random 
inclinations, and located the galaxies randomly within our 
survey volume. Gaussian noise was added to each source 
and we simulated the detection steps, including effects for 
beam offset and frequency response, to determine which 
sources were detected. We also simulated all of the subse- 
quent steps, including remeasurement of the H I flux (with 
appropriate levels of noise inserted), calculation of Vtot for 
each source using the completeness roll-off described ear- 
lier, and finally calculated the mass function for each set 
of galaxies. 

To explore the Eddington correction, the distances we 
used in our mass-function calculation were based on the 
sources' redshifts, to which we added a Gaussian disper- 
sion of 300 km s . For sources outside the core of a 
cluster, this 1-a uncertainty of 4 Mpc should represent a 
conservative estimate of our distance uncertainty. If the 
uncertainty were any larger than this we would have ex- 
pected to detect some blueshifted sources outside of the 
Virgo core. 

Each simulation required ^60,000 galaxies (down to a 
mass of 10 6 - 5 M Q ), in order to "detect" ~233 galaxies, to 
match our observed sample size. We ran 1000 simula- 
tions each for an input Schechter function with a slope 
a = —1.53 and for a = —1.2, and used our same reduction 
programs to derive the mass function from the resulting 
set of detected sources. 

Figure 5 shows the median source density derived for 
each half-decade mass interval and 1- and 2-a (68% and 
95%) confidence intervals from the 1000 simulations. The 
dots show the median recovered value from a population 
of sources assumed to follow a Schecter function with (a) a 
= -1.53 and (b) a = -1.2. The simulations show no net off- 
set from the input mass function shape; they also provide 
some guidance for the size of errors that might be caused 
by distance uncertainties and the likelihood of detection 
of low mass sources below 10 M . 

It might at first appear surprising that we find no indica- 
tion of the Eddington correction. The effect does become 
visible when we increase the dispersion in redshifts to 600 
krns" 1 , a dispersion of 1000 kms -1 raised the slope of the 
mass function from an input value of a = —1.2 to a derived 
value of about —1.4. These are much larger distance errors 
than arc plausible for our sample, so we conclude that the 
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Eddington correction is not significant for the ADBS. 

The Eddington correction assumes, in effect, that more 
high-mass galaxies will be scattered into low mass bins 
than vice versa. While there is a larger volume from which 
to draw more massive sources that are Doppler shifted to 
a particular observed velocity, there is a larger space den- 
sity of low-mass sources in the nearer volume. The result, 
at least for plausible velocity dispersions, is that compa- 
rable numbers of sources are scattered up and down into 
neighboring mass bins. 
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Fig. 5. — Statistical and distance-error uncertainties in the mass 
function. The figure shows the range of mass function results recov- 
ered from 1000 simulations of the ADBS data. The dots show the 
median recovered value from a population of sources assumed to fol- 
low a Schechter function with (a) a = —1.53 or (b) a = —1.2, shown 
respectively by solid and dashed curves in the figures. The simula- 
tions assumed a velocity dispersion of 300 km s _1 , corresponding 
to distance uncertainties of ±4 Mpc. The heavy error bars show 
the ±1<7 (68.3%) range of results; the light error bars show the ±2<r 
(95.4%) range. For the lowest mass bins, some of the expected val- 
ues were zero, in which case these are marked as upper limits. 

Finally, we note that in the a = —1.2 simulation, detec- 
tion of ^233 sources resulted in an overall density that was 
~ 1.5x higher than the Zwaan et al. (1997) mass function 
would have predicted for our search volume. In the figure 
we have shifted the comparison mass functions (dashed 
and solid lines) upward so they pass through the simula- 
tion points. This difference is expected because our mass 
function results agree with Zwaan et al. at high masses, 
but we find a larger number of low-mass sources per unit 
volume than their mass function predicts. In other words, 
the Zwaan et al. mass function predicts that we should 
have detected fewer sources within our search volume than 
we did. 



4.5. Effects of the Completeness Function on the Mass 

Function 

For the application of the SWML method to our data, 
we use only sources with S b s /J\f e ff > 7, and we treat 
them as though they would become undetectable at a dis- 
tance where they would drop below this level. Thus we 
make no use of our completeness roll-off results, which is 
in keeping with the normal application of this method. 

For the 1/Vtot method, which has been used in all previ- 
ous H I mass function studies, we consider how the results 
might be affected by differences in the calculation of the 
total detection volume for each source. 

In contrast with our empirical method of determining 
the completeness, previous surveys have usually assumed 
that the effective noise depended on linewidth as w - 5 , and 
have claimed a step-function for the completeness behav- 
ior, often at S b s /N — 5. We discuss here how such as- 
sumptions are likely to have changed the estimate of the 
mass function relative to our analysis. 

We believe that the most critical problem with earlier 
surveys is not the details of the shape of the completeness 
function, but their basic assessment of where their sensi- 
tivity "cuts off." Some surveys have used their faintest 
detected sources as an indication of a sensitivity limit. 
Figure 1 shows that using the faintest detected sources to 
determine this limit might imply completeness to a much 
lower level than was actually achieved. The total poten- 
tial detection volume Vtot predicted for each source from 
such a low limit will then be overestimated, and the mass 
function correspondingly underestimated. 

The highest mass sources are bandpass limited, so ex- 
aggerating a survey's sensitivity does not affect their 
predicted detection volume much. 3 However, low mass 
sources may only be detectable to slightly beyond the mini- 
mum detection velocity, v m i n , which is determined by such 
things as confusion imposed by local Galactic high veloc- 
ity clouds (assumed to be 200 kms -1 here). If the maxi- 
mum detectable distance {v max / Hq) is overestimated, the 
predicted detection volume, Vtot °c (d^j — v^in) may 
increase more rapidly than even the cube of the distance- 
overestimation factor. Thus, when the sensitivity of an H I 
survey is exaggerated, the assumed detection volume will 
be overestimated more for low mass sources than high mass 
sources, causing the mass function to appear too shallow. 

Using the roll-off in completeness introduces only a small 
adjustment to our estimate of the total volume in which 
a source could have been detected. If we substitute a 
step-function at the point where we reach 50% complete- 
ness, the total search volume we estimate for each source 
is always within <20% of the value calculated from our 
completeness-function-based method, and overall has a 
negligible effect on the mass function. Likewise, using the 
linewidth dependence that we have established empirically 
only makes a small change from assuming a w 5 noise de- 
pendence. In fact, if we assume a w 0,5 noise dependence, 
the slope of our derived mass function becomes marginally 
steeper. 



3 The volume estimates are still somewhat affected in a driftscan survey like the ADBS or Zwaan et al. (1997) because even high-mass sources 
will be relatively weak at large declination offsets. 
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5. IS THE H I MASS FUNCTION AFFECTED BY LARGE 
SCALE STRUCTURE? 

The question labeling this section has two aspects: (1) 
Does large scale structure introduce a bias into our esti- 
mate of the H I mass function? (2) Are there variations in 
the shape of the mass function in different density regions? 
In §5.1 we show that large scale structure does not bias our 
HI mass function results. In §5.2 we show that the shape 
of the HI mass function, on the other hand, does vary in 
different density regions. 

5.1. Density Corrections 

Concerns about the effect of large scale structure within 
our search volume should be largely eliminated by the use 
of the SWML method. We have noted, though, that there 
are potential problems with this method if the shape of 
the mass function varies with the local density of galaxies. 

To understand possible effects of large scale structure 
we have used redshift data for optically cataloged galax- 
ies to conduct a check on the amount of density varia- 
tion (as a function of redshift) within our main survey 
region: 18.0 < S < 28.7° at all right ascensions and 
8.0 < S < 15.7° at < a < I6 h (see Paper 1). We 
used galaxies with photographic magnitudes brighter than 
m < 14.5 after correcting for Galactic extinction (Schlcgcl 
et al. 1999), drawn from the NASA/IPAC Extragalac- 
tic Database (NED). We examined the NED sources in 
narrow redshift ranges, and found that the shape of the 
optical luminosity function fit a Schechter function with: 
a = —1.05, M* = —19.9. After correcting the veloci- 
ties using the same method as for the H I measurements 
(Tonry et al. 2000), we derived the optical source density 
at each redshift by comparing the actual number counts 
to the predicted counts from the luminosity function. The 
resulting run of density with corrected velocity is shown 
in Fig. 6. 




cz (km s~') 

Fig. 6. — Density distribution of galaxies within the survey region 
based on optically selected galaxies. The solid line shows the mean 
density as a function of redshift for the full ADBS. The dashed line 
shows the density outside the core of Virgo. The dotted line shows 
the density within a 27° radius from the center of Virgo. 

Based on the optical sample of galaxies, we find that the 
density within the ADBS survey region (excluding galax- 
ies within 6° of the core of Virgo) is fairly uniform except 
within the interval 2000 < cz < 4000 kms -1 where the 
density is about half of the other regions (solid line in fig- 
ure). The higher density at other redshifts reflects (a) the 
local supercluster at small redshifts, (b) the Pisces-Perseus 
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supercluster at intermediate redshifts, and (c) the Coma 
cluster and associated "great wall" region at high redshifts. 
It is important to note that our survey region contains a 
mix of high and low density regions that average out to a 
fairly uniform density. 

In the slightly underpopulated interval of 2000 < cz < 
4000 kms -1 , the ADBS is primarily sensitive to galaxies 
with H I masses in the 8.5 < \og(MHi /Mq) < 9.5 range. 
The effect of the lower density will tend to suppress the 
counts in this portion of the mass function when using the 
1/Vtot method. We can attempt to make a density correc- 
tion to the l/Vtot mass function using the optically-based 
density. We modified the Vtot integral (§4.1), multiplying 
it by the density of sources at each velocity — essentially 
treating the higher density regions as having a higher de- 
tection probability (see also SSR). 

The effect of this correction is shown in Fig. 4 with 
square symbols. Again the changes to the mass function 
are minor, although there is a slight elevation of source 
counts in estimated density of sources around 1O 9 M as 
expected. After making these density corrections, the 
overall normalization of the mass function rises slightly 
to = 0.0058 Mpc~ 3 . 

Using optical estimations of the density to correct the 
H I-selected source counts is crude at best, but it gives 
us some indication of how the mass function shape and 
normalization might change. We expect that H I-selected 
sources will be less clustered than optically-selected galax- 
ies, since galaxies in higher-density regions may be de- 
pleted of gas, so the method is likely to over-correct any 
density problems. The lack of significant changes to the 
1 /Vtot mass function resulting from this density correction 
therefore indicates that the results are unlikely to be much 
affected by large-scale structure within the search region. 

We also note that the low-H I mass galaxies in the ADBS 
are at similar or higher velocities than the sources in the 
previous Arecibo H I surveys, and the density structure 
we find here is more uniform than in those surveys (see 
SSR). Therefore our density estimate for the lowest mass 
bin should be less subject to bias than those earlier sur- 
veys. 

5.2. Effects of Environment 

The Arecibo Dual-Beam Survey covers a large swath of 
sky that includes the Virgo Cluster. Within the Virgo 
region, a signficant overdensity of galaxies is evident out 
to about 27° from the center of the cluster (dotted line 
in Fig. 6). In our primary analysis we included galax- 
ies outside of the 6° core but within 27° of the center of 
Virgo. Including the galaxies in the outer regions of Virgo 
yielded a more uniform overall density, while still avoiding 
the large distance ambiguities associated with the core re- 
gion. This gave a mix of higher and lower density regions 
nearby, much as cluster and field regions also contribute to 
our results at higher redshifts. The region out to a radius 
of 27° from the center of Virgo has a mean overdensity 
of about a factor of 10. We examine here some possible 
changes in the character of the H I mass function within 
such a high density region. 

Our sample of Virgo galaxies is small — 38 galaxies 
within 27° of the center of the Virgo cluster, including 13 
galaxies within the inner 6°. More distant clusters are not 
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useful for probing the H I mass function in high density re- 
gions because they are too far away for the ADBS to have 
detected low mass sources. We caution that small num- 
ber statistics make our conclusions uncertain, but there 
are indications that the mass function is less steep in high 
density regions. 




Fig. 7. — The H I mass function within the Virgo cluster region. 
The solid circles show the SWML estimate for galaxies within 27° of 
the center of the cluster and with redshifts smaller than cz < 2300 
kms . The open squares show the 1/Vtot results over the same 
area. The solid and dashed curves are the same as in Figure 2. 
The data from the Arccibo Slice survey (Schneider et al. 1998) are 
shown as gray xs. These data match the shallower Schcchtcr fit 
where the data are being drawn from a cluster region (at the high 
mass end) and the steeper Schechter fit where they are being drawn 
from a lower density region (at the low mass end). 

We calculate the mass function using the same meth- 
ods as we described above. Both the SWML and 1/Vtot 
methods suggest a flatter mass distribution. This result 
was found whether we assigned all of the galaxies inside 
6° with redshifts smaller than cz < 2300 kms^ 1 to a fixed 
distance of 16.8 Mpc or used the solution from the Tonry 
et al. (2000) flow model. In the figure, the SWML results 
use the flow model distances while the 1/Vtot results are 
based on a fixed distance for galaxies in the core region. 
We applied the density correction described in §5.1 to the 
1/Vtot results. If we do not apply this correction, the nor- 
malization of our mass function would make the density 
higher than our earlier mass function at all masses, and 
about 10 x higher overall. Accounting for the factor of 10 
overdensity in the normalization allows us to more directly 
compare the results. 

The differences in the mass function with galaxy den- 
sity may help explain the difference between our current 
mass function (solid line) and our earlier estimate based on 
the Arecibo Slice of Spitzak & Schneider (1998) which is 
shown by gray x symbols in Figure 7. There is a problem 
with the distribution of galaxies in the Arecibo Slice (see 
Figure 8: all of the low mass and none of the high mass 
galaxies are located at low redshifts while the opposite is 
true at high redshifts. 

The relatively limited area covered in the Arecibo 
Slice was dominated by the Pisces-Perseus supercluster 
at higher redshifts, so the mass function for H I masses 
greater than ~ 10 8,5 is dominated by cluster galaxies. H I 
masses lower than ~ 10 8 ' 5 are drawn from lower density re- 
gions. The "turn up" that appears to occur at low masses 
in the Arccibo Slice is consistent with the higher mass 



points being drawn from a high density region and the 
low mass points from a low density region. The lowest 
mass points are consistent with a slope of a m —1.5 while 
the higher mass points, drawn from the higher density re- 
gions, are consistent with a slope of a = —1.2. The joining 
of these two nearly independent H I mass functions pro- 
duces the apparent "turn-up" in the Arecibo Slice H I mass 
function. 
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Fig. 8. — The redshifts of galaxies as a function of their H I mass 
in (bottom panel) the Arccibo Slice (Spitzak & Schneider 1998) and 
(top panel) the ADBS. 

By contrast, Figure 8 demonstrates that the ADBS cov- 
ered a large enough area that several high mass galaxies 
were detected at lower redshifts. The larger range of dis- 
tances over which high mass galaxies are detected helps to 
anchor the relative number density of high and low mass 
galaxies. 

6. LIMITS ON ULTRA-HIGH-MASS H I SOURCES 

The galaxy Malin 1, identified by Bothun et al. (1987), 
has been pointed to as evidence that we might be missing 
a significant population of high mass, low surface bright- 
ness galaxies (Disney et al. 1987). Large, H I-rich but low 
surface brightness galaxies are extremely difficult to detect 
with standard optical techniques. H I surveys that cover 
large enough volumes are ideal for identifying these galax- 
ies since they are as easy to detect as any other galaxy 
with a large H I mass. 

The ADBS data confirm that a large population of mas- 
sive H I-rich galaxies is not lurking just below the sky 
surface brightness limits. While our statistics at the high 
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mass end of the mass function are poor, we are sensitive to 
these galaxies in a large volume (~8x 10 5 Mpc 3 ) as seen 
by the fact that we detect high-mass sources with offsets 
as large as 12' from the center of the main beam. The 
mean fluxes of these galaxies are ~10% of their remea- 
sured values, but with our sensitivity limits, we should be 
able to detect galaxies in excess of 5 x 10 10 M Q at these 
large offsets out to the limiting redshift of our survey. 

With our ability to detect sources this far from the mid- 
line of the drift-scans, the declination strips nearly overlap 
(the strip separations alternate between 0.6° and 0.4°), 
which corresponds to a volume coverage of 8xl0 5 Mpc 3 
over an area of ~1 sr for these high mass systems. Nev- 
ertheless, we detect no high mass, low surface brightness 
galaxies. We do not worry about column density con- 
straints in quoting this number because even a Malin-1- 
like galaxy would not be resolved in most of the volume. 
For very nearby giants, we are only sensitive down to a 
column density > 2 x 10 19 cm~ 2 . 

In the nearby regime, the Zwaan et al. (1997) sur- 
vey places a much tighter constraint on the population 
of high mass, low surface brightness galaxies because it 
is sensitive to column densities > 2 x 10 18 cm~ 2 . These 
statistics restrict the population of Malin 1-like galaxies to 
< 5.5 x 1(T 6 Mpc" 3 . 

7. SUMMARY AND DISCUSSION 

We have used the ADBS to study the mean H I mass 
over a wide range of environments, and find a steep-sloped 
Schechter function: a = -1.53, = 9.88M©, = 
0.0058 Mpc~ 3 (the normalization is lower, $* = 0.0048 
Mpc -3 , when a density correction for large scale structure 
effects is not applied). The ADBS mass function results 
have been derived with very careful attention to the sen- 
sitivity function. We inserted "synthetic" sources which 
underwent all of the data processing procedures to allow 
us to derive the sensitivity as a function of H I line width 
and to derive the completeness function. Additionally, we 
find that our mass function results are robust to changes 
in an assortment of parameters such as the minimum dis- 
tance and velocity flow models, and are consistent whether 
we use the l/V tot or SWML method. 

Our mass function differs significantly from some previ- 
ous determinations, but this is probably because, in large 
part, previous H I mass functions were derived from opti- 
cally selected samples or from samples that surveyed high 
density regions. It appears likely that the mass function 
has a different shape in clusters and in the field. There 
is evidence from optical surveys that the luminosity func- 
tion evolves with time (Lin et al. 1999, Sawicki et al. 
1997) and density may also affect the shape of the mass 
and luminosity functions (Phillipps et al. 1998; Wilson 
et al. 1997). Gas stripping, evolution, and the merger 
rate of galaxies, which are more significant in higher den- 
sity environments, may preferentially remove gas from low 
mass systems or destroy them altogether. We find that the 
mass function in the Virgo cluster has a shallower faint- 
end slope, a = —1.2, similar to that found by Verheijen 
et al. (2000) in the Ursa Major region, which also has an 
overdensity of about a factor of 10. 

It is interesting to note that the changes in a we find 
with density may exhibit the opposite trend in optical sam- 



ples. Phillipps et al. (1998) find that the slope of the 
luminosity function gets steeper in the higher density re- 
gions. This difference emphasizes that galaxies' gas mass 
and optical luminosity arc not directly related. 
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Fig. 9. — The relationship between H I mass density (left-hand 
y-axis) or Vlju (right-hand y-axis) and H I mass. Most of the galaxy 
contribution to the H I mass density and Qhi is from M* galaxies, 
but the contribution from low mass sources is not negligible. The 
solid line fit to the histogram shows our Schechter function, a = 
-1.53. The dashed line shows the Zwaan et al. (1997) value, a = 
-1.2. The integral of this curve represents ~1% of Qi, that resides 
in the H I component of galaxies. 

Figure 9 shows the H I mass density of galaxies (phi) 
and the fraction of the critical density (pm / Pcrit) con- 
tained in H I-rich galaxies as a function of the galaxy's H I 
mass. This figure demonstrates that the mass contribution 
to (phi/ Pcrit) is largest from galaxies near M„. However, 
our steep-sloped mass function indicates that the contribu- 
tion from low mass sources is not negligible. The value of 
fi b inferred from D/H studies is 0.0445/i 2 5 (Buries & Tytler 
1998). Given this value, we find that ~1% (0.000484/i 2 . 5 ) of 
the baryonic mass is contained in the H I within galaxies. 
By contrast, Penton et al. (2000) have shown that ^20% 
of the baryons are tied up in low column density hydro- 
gen, primarily H II, observed in the Lyman-a forest. The 
relative contribution of the high and low density material 
suggests that H I-rich galaxies are relatively rare concen- 
trations of neutral gas embedded in a more substantial low 
density medium. 

The ADBS is one of the largest surveys to date, par- 
ticularly with respect to low mass sources, yet the statis- 
tics at the low mass end remain thin. Given practical 
limits for existing 21 cm telescopes, it seems likely that 
this will remain a problem for some time to come. It 
will therefore be vital for future surveys to carefully ac- 
count for the completeness function in their design. A 
particularly important question for future endeavors will 
be the relationship between the shape of the mass function 
and galaxy environment. To understand evolutionary pro- 
cesses in galaxies, we will have to establish how the mass 
function changes with environment and with time. 



The Digitized Sky Surveys were produced at the Space 
Telescope Science Institute under U.S. Government grant 
NAG W-2166. The images of these surveys are based 
on photographic data obtained using the Oschin Schmidt 
Telescope on Palomar Mountain and the UK Schmidt Tele- 
scope. The plates were processed into the present com- 
pressed digital form with the permission of these institu- 
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tions. Propulsion Laboratory, California Institute of Technology, 

This research has made use of the NASA/IPAC Extra- under contract with the National Aeronautics and Space 
galactic Database (NED) which is operated by the Jet Administration. 
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